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METHOD, SYSTEM, AND PROGRAM FOR 
MANAGING A MULTI-PAGE DOCUMENT 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

[0001] The present invention relates to a method, system, and program for managing a 
multi-page document. 

2. Description of the Related Art 

1 0 [0002] In publishing, the pages of a document are scanned, transformed into a printer 
readable format, and then printed and/or archived in a computer storage device for 
subsequent retrieval and printing. Publishing software is used by publishing companies to 
electronically print and archive books. Pages of a document are scanned to produce an 
electronic bit-map representation of the pixels on the scanned page and the pixel color 

1 5 values. The page may contain line art, e.g., text and lines, and/or continuous tone, e.g., 
images. For black and white images, the scanned image is stored as a grey scale image 
wherein each pixel represented on the bit map has a corresponding shade of grey value. 
Optical scanners typically allow for anywhere from 16 to 256 shades of grey for each pixel. 
A bit map of a page that allows for grey scale values requires 4 to 8 bits for each pixel to 

20 indicate the different shades of grey. A bit map representing text only, where the pixel is 
either black, i.e., "on," or white, i.e,, "off," only requires 1 bit per pixel to indicate either 
"on" or "off," black or white. A bit map that represents a color page, including line art and 
continuous tone, requires even more space than black and white images. Each pixel of a 
scanned color image can be one of 16.7 million colors, which requires 24 bits for each 

25 pixel. 

[0003] After the page is scanned, the bit map produced by the scanner must be further 
processed into a bit map format that the printer can process. The transformation of the 
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scanned bit map into a printer readable format is referred to as halftoning or raster image 
processing (RIP). In a publishing environment, halftoning or transforming, i.e., RIPping, 
numerous bit maps that represent the pages of a large document, such as a book, can take 
several hours. The result of the halftone transformation is a bi-level image, where each bit 

5 map value indicates either black or white. The halftoning transformation would use 
screening and dithering algorithms to produce a wide level of shades of grey or colors. 
Dithering refers to the illusion of printing colors or shades of grey by varying the patterns of 
dots. Thus, a bit map where each bit map value can correspond to one of many values 
indicating different shades of grey or colors is converted to a bi-level bit map where each 

1 0 value indicates one of two colors -- black or white. However, the halftoning or 

transformation process uses dithering to simulate the shades of grey or different colors. 
The term "gray scale" as used herein refers to a bit map where each bit map value can be 
one of numerous values, indicating different colors or shades of grey. The term "halftone" as 
used herein will refer to the transformed grey scale bit map to a bit map that can be 

1 5 processed by the printer. 

[0004] A halftone bitmap may be a bi-level bit map where there is only one value for 
each pixel, i.e., 0 or 1. Additionally, contone printers print a limited number of shades of a 
color or grey. Thus, a halftone bit map for a contone printer may include a limited number 
of values for each bit map value and also use dithering to simulate additional shades of grey 

20 and color. The term "halftoning" as used herein refers to the transform process of 

transforming a gray scale bit map to a printer readable format, such as a bi-level bit map or 
contone bit map capable of a limited number of different values for each pixel that are 
available on the printer. Halftone bit maps are typically specific to a particular printer, i.e., 
device dependent. 

25 [0005] The gray scale or halftoned bit map of each page in the book may be stored in 
one or more files. For instance, the tagged image file format (TIFF)** stores one bit map 
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for each page in a single file. A TIFF file can store both the gray scale bit map and a 
halftoned bitmap. 

[0006] The gray scale bit map is referred to as device independent because it represents 
the image without reference to a specific device. The halftone bit map is device dependent 
5 as the gray scale bit map was transformed, e.g., screened or halftoned, into a halftone bit 
map that is particularly suited to the dithering capabilities of the printer on which the halftone 
bit map will be printed. 

[0007] In publishing environments, to reprint copies of a book, a person will manually 
scan each page of a book. If there are images on a page, then the person must use a 
1 0 software tool to manually select the image section with the software tool and then screen the 
image section to produce a halftone bitmap of the image. This process of digitizing a 
document or book to publish is very tedious, time consuming and requires substantial labor 
costs. 

[0008] Moreover, once the halftone is produced for one specific output device, such as a 
1 5 particular printer having specific characteristics, outputting that halftoned image on other 
printer types may produce unpredictable and undesirable output, such as moire effects. To 
provide optimal image reproduction, the publisher would have to rescreen the image on the 
page to produce a new halftone for other printers on which the document will be 
reproduced. 

20 [0009] For all the above reasons, there is a need in the art to provide an improved 

technique for scanning and publishing books or documents in a digital printing environment. 

SI IMMARY OF THF PREFERRED EMBODIMENTS 
[0010] Provided is a method, system, and program for managing a multi-page document. 
25 A plurality of input files are received including digital representations of multiple pages of a 
document. The content of each entire page is represented in a first representation format. 
The entire page content is capable of including content of a first content type and second 
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content type. For each page of the document represented in the input files, a determination 
is made of regions in the page including content of the second content type. Each 
determined region in the page is processed to generate the content for each region in a 
second representation format. The content of the entire page in the first representation 
5 format and the content of each region in the second representation format is added to at 
least one output file. The output file is then stored. 

[0011] Further, the first content style may comprise line art and the second content style 
may comprise a continuous tone image. 

[0012] Yet further, the first representation format may comprise a device dependent 
1 0 image format and the second representation format may comprise a device independent 
image format. The device dependent format may comprise a halftone image format and the 
device independent format may comprise a grey scale image format. 
[0013] In additional implementations, output files are selected for rendering. For each 
page represented in the selected output files, the content of the entire page in the first 
1 5 representation format and the content for each region in the page in the second 

representation format are accessed. The content for each region in the page in the second 
representation format is processed to generate the content for each region in the first 
representation format The content for the entire page and the content for each region in 
the first representation format are merged into page output in the first representation format 
20 and rendered into a human observable format. 

[0014] Still further, the multi-page document may be managed within a network 
publishing system to archive the document for later rendering on one of multiple network 
printing devices. 

[0015] The described implementations provide an improved technique for managing 
25 digital pages of a multi-page document in a manner that is especially useful when wanting to 
save the multi-page document in a format that may be rendered on multiple output devices, 
such as different printer types. The described implementations manage the digital pages so 
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that the output of the files sent to different printing devices is not reproduced with 
undesirable output effects, such as moire effects. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 [0016] Referring now to the drawings in which like reference numbers represent 
corresponding parts throughout: 

FIG. 1 is a block diagram illustrating a computing environment in which certain 
described implementations of the present invention are implemented; 

FIG. 2 illustrates an image file format data structure in accordance with the 
1 0 described implementations of the present invention; 

FIG. 3 illustrates logic to generate an image file in accordance with the described 
implementations of the present invention; and 

FIGs. 4a, b illustrate logic to process an image file to print a document in 
accordance with preferred embodiments of the present invention. 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0017] In the following description, reference is made to the accompanying drawings 
which form a part hereof and which illustrate several embodiments of the present invention. 
It is understood that other embodiments may be utilized and structural and operational 

20 changes may be made without departing from the scope of the present invention. 
[0018] FIG. 1 illustrates a network printing environment 2 in which preferred 
embodiments are implemented. A workstation 4, printer server 6, storage system 8, 
scanner 10, and printer 12 are all interconnected via a network 14, The scanner 10, which 
may comprise any optical scanning device known in the art, receives printed pages 16a, b, 

25 c including text and/or images and converts the printed pages 16a, b, c into digitized 
representations 16 f a, b, c of the pages 16a, b, c in a gray scale or halftone format. If the 
pages 16% b, c are in a gray scale format, then each page would be represented by a 
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bitmap data structure having one of multiple values for each bit value, e.g., 24, to represent 
different shades of grey, or different colors in the case of color printers. Additionally, the 
scanner 10 may halftone the gray scale representation of the pages 16a, b, c when 
producing the output pages 16% b, c. In such case, the output pages 16 T a, b, c would be 
5 optimized for a specific printer. The scanned output pages 16% b, c may be stored in the 
storage system 8. 

[0019] The storage system 8 includes digitized pages 20a, b, c including text and or 
images in either gray scale format, where each bit represents one of multiple values, or a 
halftoned image that is screened and optimized for a particular printing device. The 
1 0 digitized pages 20a, b, c may have been previously scanned pages or pages received from 
an alternative source. 

[0020] The workstation 4 comprises any computing device known in the art, e.g., 
desktop, laptop, mainframe, server, hand held computer, etc., and includes a publishing 
tool 18. The publishing tool 18 is a software program that executes within the workstation 

15 4. The publishing tool 18 receives digitized images comprising gray scale and/or halftoned 
bit maps of pages of text and/or images from the scanner, e.g., pages 16'a, b, c, or from the 
storage 8, e.g., pages 20a, b, c, and generates output pages 22a, b, c that comprise files 
including digital representations of the text and/or images on each input page in accordance 
with the described implementations of the invention. The output pages 22a, b, c may 

20 include digital representations of one or more pages of a document or book, and may be 
stored in the storage system 8 for later retrieval and printing. 
[0021] The printer server 6 is capable of accessing the image files 20a, b, c and 
performing any user selected processing on such files, such as RIPping or halftoning a gray 
scale bit map or touch-up halftoning operations on a halftone bit map. The printer server 6 

25 may be a separate stand alone server including an operating system such as A1X, OS/390, 
UNIX, OS/2, MVS, WINDOWS, etc.** For instance, the printer server 6 may be 
implemented on the IBM RS/6000 server and include the AIX operating system and the 
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IBM INFOPRINT Manager software.** The storage system 8 may be any suitable non- 
volatile storage device, such as a hard disk drive, a direct access storage device (DASD), 
which is comprised of interconnected hard disk drives, a tape storage device, automated 
tape library (ATL), optical disk library, holographic units, etc. 
5 [0022] The printer 12 may be any suitable printer machine known in the art, including 
printers capable of handling large scale publishing jobs, such as printing an entire book. 
The network 14 may be comprised of any suitable network architecture known in the art, 
such as LAN, Ethernet, WAN, Storage Area Network (SAN), Token Ring, LocalTalk, 
TCP/IP, the Internet, Fibre Channel, etc. Alternatively, there may be separate and different 

10 networks between the devices 4, 6, 8, 10, and 12. 

[0023] The publishing tool 1 8, in which the program code for processing input page files 
16'a, b, c, 20a, b, c and generating pages 22a, b, c, generates output pages 20a, b, c in a 
TIFF file implementation illustrated in FIG. 2 that stores halftone and/or gray scale bit 
maps of the page. A TIFF file contains one or more image file directories (IFD) that 

1 5 include information on a bit map representing a page in a document In current art, a TIFF 
file maintains an IFD for each bit map representation of a page, either halftoned or gray 
scale, in a document included in the TIFF. A TIFF file may include a bit map for one or 
more pages from a document. Details of the TIFF format are described in the publication 
"TIFF, Revision 6," (Copyright Adobe Systems, Inc., 1986-88, 1992), which publication is 

20 incorporated herein by reference in its entirety. 

[0024] In the described implementations, the TIFF file 50 may maintain multiple bit maps 
for each page in a document or, alternatively one or more bit maps for only one page of the 
multi-page document. For instance, with respect to FIG. 2, IFD 0 may maintain 
information on a halftone bit map in the TIFF file for a page of a multi-page document, 

25 where the halftone may include both line art and/or continuous tone. The TIFF file 50 may 
fiirther include one or more gray scale bit maps for the image regions within the page, 
wherein IFD! would include information on a gray scale bit map of a first image in the page. 
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Information on (j - 1) additional gray scale bit maps in Hie TIFF file 50 for further images 
on the page may be maintained in IFE^ to IFD r In this way, the TIFF file implementation 
maintains a halftone and IFD for each page, including a halftone of both line art and any 
continuous tone images, and additional associated IFDs and gray scale bit maps for each 
5 image region within the page. 

[0025] FIG. 2 illustrates how IFDs, text, and images are maintained in a TIFF file 50 in 
accordance with the described implementations of the present invention. In FIG. 2, the 
TIFF file 50 maintains bit maps for one page. An image file header 52 includes a pointer or 
offset 54 to the first IFD 0 56, including directory entries 58 on the location of a halftone of 
1 0 the contents, i.e., line art and/or continuous tone, of the first page in the TIFF file 50. The 
IFDs of the preferred embodiments include a private tag 60 which indicates whether any 
gray scale bit maps are maintained for the page and a pointer to the IFDj for the first gray 
scale bit map for the page. As discussed, one or more gray scale bit maps may be 
provided for each page if the page contains one or more images. A private tag indicates 
1 5 information allocated in a private number range that is registered with a TIFF adrninistrator 
and is meaningful to the particular organization defining the private tags. In preferred 
embodiments, the private tag is used to indicate IFDs for one or more gray scale bit maps 
for a particular page, and a pointer to the IFD for the gray scale bit map. 
[0026] In FIG. 2, the private tag 60 in the first IFDq points to IFD! 62, which includes 
20 directory entry 64 information on a first gray scale bit map of a continuous tone image 

within the first page. If there are additional gray scale bit maps maintained for the first page, 
then IFD, 62 would include, as shown in FIG. 2, an offset 66 to IFDz 68 including 
information on a second gray scale bit map of a continuous tone section of the page. 
Additional IFDs may be maintained for any additional gray scale images maintained for the 
25 page. The finalyth IFDj 70 would include a value of zero for the offset 72, indicating that 
there are no further gray scale images maintained for the page. 
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[0027] In the described implementations, a TIFF file may maintain, for each page, a 
halftone image of the entire contents of one page, including line art and continuous tone, and 
gray scale bit maps for each image within a page. This allows a single TIFF to maintain 
device dependent information, i.e., the halftone bit map, and device independent 
5 information, the gray scale bit maps, for each page. Such TIFF files may be archived pages 
of a book or larger document maintained in the storage system 8. In the above described 
implementations, only one page of a multi-page document is maintained in each TIFF file, 
thus requiring multiple TIFF files for a multi-page document. In additional implementations, 
a single TIFF file may maintain halftone and/or gray scale bitmaps for multiple pages in the 
1 0 manner described in the commonly assigned and co-pending patent application entitled 
"Image File Format for Storing Graphical Matter and Related Information", having U.S. 
Application Serial No. 09/323,163 and filed on May 27, 1999, which patent application is 
incorporated herein by reference in its entirety. In implementations where a single TIFF file 
maintains the bitmaps for multiple pages, the base IFD for the first page includes offset 
1 5 information indicating the location of the next base IFD for the halftone of the second page. 
Each subsequent page may include pointers to further bitmaps for each page and a pointer 
in the base IFG to the IFD of the next page. Each IFD further includes orientation 
information indicating the positioning within the page of the image represented by the bit 
map associated with the IFD. 
20 [0028] FIG. 3 illustrates logic implemented in the publishing tool 1 8 to process input 
pages in a halftone or gray scale format to produce output TIFF files providing 
representations of the document pages. Control begins at block 100 with a user invoking 
the publishing tool 100. The publishing tool 100 receives (at block 102) user selection of N 
input files comprising halftoned pages of a document from the scanner 16'a, b, c or storage 
25 8 20a, b,c that are halftoned and optimized for a specific printer. As discussed, using 
another printer type to render the halftone generated for a different printer may yield 
undesirable output results, such as moire effects. The user may specifically designate the 
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input pages of the document. Alternatively, an index or document ticket may be provided 
that provides an ordered list of the halftone representations of each page of the document. 
The publishing tool 100 then performs the loop at blocks 104 to 136 for each user 
specified page / of the document. 

5 [0029] At block 106, the publishing tool 18 processes page i and performs (at block 
106) a segmentation operation to determine the image and text areas of page /'. The 
publishing tool 18 may use any technique known in the art to segment the image and text 
regions of the page, such as a segmentation technique based on gradient information. 
Details of implementations for the segmentation algorithm utilized by the publishing tool 18 

1 0 are described in the co-pending and commonly assigned patent application entitled 

"Method and Apparatus for Repurposing Binary Images", having U.S. Application Serial 
No. 09/1 10,900, filed on July 6, 1998, which patent application is incorporated herein by 
reference in its entirety. The gradient segmentation technique determines the halftone areas 
as those regions containing more transitions between black and white values than text areas. 

1 5 Thus, the image sections may be deemed those to have a higher gradient activity per unit 
area. 

[0030] The publishing tool 1 8 then determines (at block 108) the m bounded areas of 
each image area on page i. The publishing tool 18 may utilize a clustering technique, such 
as that described in the patent application "Method and Apparatus for Repurposing Binary 
20 Images", having U.S. Application Serial No. 09/1 10,900, incorporated by reference 
above, that determines rectangular or other shaped areas of the image areas. Those 
regions outside of the image area comprise text. 

[0031] At block 1 12, the publishing tool 1 8 creates (at block 1 12) IFDq for a TIFF file 
to include information on the received halftone of the entire page / and its location in the 
25 TIFF file. If there are images on the page (at block 1 14), then the publishing tool 1 8 sets 
(at block 1 18) the private tag in IFDq to include an offset to WD X for the first image on 
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page i. If there are no images on page i, then the private tag in IFDo is set (at block 1 16) to 
zero. 

[0032] If there are images in page i, then from block 1 1 8, the publishing tool 1 8 
proceeds to perform the loop beginning at block 120 for each image; in page /, where; 
5 extends from 0 to m. The publishing tool 18 accesses image; on page / and then 
descreens (at block 122) the halftone of the image; to generate a grey scale or 
intermediate grey scale representation of the image in a manner known in the art. For 
instance, the publishing tool 18 may utilize the descreening process described in the patent 
application "Method and Apparatus for Repurposing Binary Images", having U.S. 
1 0 Application Serial No. 09/1 10,900, incorporated by reference above, which discloses a 
technique for descreening screened halftone images. Such descreening techniques may 
involve the use of a boxcar filter or Gaussian blur. Further details of the segmentation and 
descreening processor for halftone images are also described in the publication entitled 
"Descreening Printer-Ready Images", by A. R. Rao, Frederick Mintzer, Gerhard 
1 5 Thompson, IS&Ts NIP 14: 1998 mternational Conference on Digital Printing 

Technologies, pgs. 285-289 (Copyright 1998), which publication is incorporated herein by 
reference in its entirety. The output of the descreening process is an intermediate grey 
scale image that does not contain any halftone screen. After generating the descreened 
gray scale representation of image section ;, the publishing tool 18 adds (at block 124) 
20 information on image ; in IFD,, including the location of the gray scale bit map for image; in 
the TIFF file and orientation information on where to orient image; within page i. The 
mixed format TIFF file providing information to reproduce page / may then be stored (at 
block 134) in storage 8. If (at block 126) there is another image on page i, then the 
publishing tool 18 adds (at block 130) offset information pointing to IFD ;+/ for the next 
25 image and then proceeds back to block 120 to perform another iteration of the loop 
for the next image. If there are no further images, then the publishing tool 18 sets (at block 
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128) the offset information to zero and proceeds (at block 136) to process the next (/ +1) 
to generate the mixed format TIFF file for the next page of the document. 
[0033] In this way, a TIFF file including a halftone bit map provides a halftone of the 
entire image that is suitable for generating the text on different printers. The TIFF file 
5 further includes gray scale bit maps for each image on a page to allow for use with different 
printers in a manner that produces desirable output The TIFF files for each page of the 
document may then be later retrieved and printed on any type of printer device because the 
image sections may be halftoned and rendered directly from the gray scale representation of 
the image. 

1 0 [0034] FIGs. 4a, b illustrate logic implemented within the publishing tool 1 8 to process a 
request to access and print the mixed format TIFF files, generated according to the logic of 
FIG. 3. The printing process to print a document archived in the storage system 8 in TIFF 
files 20a, b, c begins at block 200 in FIG. 4a. The publishing tool 18, through a graphical 
user interface (GUI), allows the user to select a print quality for the print job, such as high 

1 5 or low. The low quality may be used to proof the job to observe the general layout of the 
printed document. High quality is used for the final print job. The publishing tool 18 
determines (at block 202) the quality the user selected, i.e., high or low. The publishing 
tool 18 then determines (at block 204) the TIFF files including pages for the selected 
document. As discussed halftone and gray scale bit maps representing each page in a 

20 document may be archived in multiple TIFF files 20a, b, c. 

[0035] The publishing tool 1 8 then proceeds to begin a loop (at block 206) for each 
TIFF file 20a, b, c of the selected document. If (at block 210) the user did not select a 
high print quality, then the publishing tool 1 8 accesses (at block 2 12) the halftone bit map 
described in IFD 0 and sends the halftone bit map to the printer 12 for printing. Thus, the 

25 halftone bit map is used to print the page for the selected TIFF, regardless of whether the 
TIFF includes both text and image(s), i.e., continuous tone. From block 212, the printer 
server proceeds to block 216 to process any further TIFF files including page data for the 
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document by performing another iteration of the loop beginning at block 206. If the print 
quality is high, then the publishing tool 18 determines (at block 214) whether the private tag 
for IFD o has an offset pointing to one or more IFDs providing information on gray scale bit 
maps associated with the TIFF file for image sections of the page. If there are no gray 
5 scale bit maps for image regions in the page, then the publishing tool 1 8 sends (at block 
212) the halftone bit map indicated in WD 0 to the printer 12 for printing. In this case, the 
page represented in the selected TIFF file does not include any continuous tone, so that the 
halftone bit map can produce high quality line art, such as text. 

[0036] If the private tag points to a chain of one or more IFDs beginning at FD b then 
1 0 the publishing tool 18 begins a loop (at block 218, in FIG. 4b) for each image/ associated 
with the page. As part of this loop, at block 218, the publishing tool 18 accesses (at block 
200) the gray scale representation of image j using IFDj and halftones the gray scale bit 
map for image ;' using any halftone technique known in the art, such as halftoning 'with a 
dithering mask. The gray scale bit map would be halftoned to optimize the output for the 
1 5 printer 12 selected to print the document represented in the TIFF files 20a, b, c. The 
publishing tool 18 then merges (at block 222) the halftone bit map generated for image; 
into the halftone for page i, described in IFD 0 . The publishing tool 18 would use the 
orientation information maintained in the IFDj, } for image; to merge the halftone bit map of 
the image j into the correct location in page i. After merging the halftone for image; into 
20 the halftone for the page, the publishing tool 1 8 determines (at block 224) whether the 
offset for IFDj indicates a further image (j + 1) for the current page. If so, the publishing 
tool 1 8 proceeds (at block 226) to perform another iteration of the loop beginning at block 
218 to halftone the next image (j + 1) and merge such halftone into the current halftone for 
page. Thus, the halftone for the page may have multiple levels of halftone bit maps merged 
25 therein. If (at block 224) there are no further images for the page to merge into the 
combined halftone for the page, then the publishing tool 18 sends (at block 228) the 
generated combined halftone bit map for the page, which may include multiple levels of 
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merged halftones, to the printer 12 for printing. The publishing tool 18 then proceeds (at 
block 232) back to block 206 to process any further TIFF files including additional pages 
for the selected document. 

[0037] The described implementations provide a technique for managing the input files for 
5 a print job that minimizes the use of human labor because the processing, segmentation, and 
rendering of the pages is handled by the publishing tool 1 8 automatically. Further, 
performance and output is optimized, because the mixed format TIFF files 20a, b, c may be 
used to produce quality output on different printers as the image portions of the pages are 
maintained as a gray scale bit maps. In this way, the TIFF file format includes both device 

10 dependent data, such as the halftone of a page, and device independent data, such as the 
gray scale of the continuous tone images in the page. If the user desires to print at a low 
quality for proofing purposes and speed, then the publishing tool 1 8 will transmit the already 
transformed halftone bit map representations of the pages, including line art and/or 
continuous tome, to the printer 12 for printing. This process will maximize printing 

1 5 performance and minimize use of processor cycles as the publishing tool does not have to 
transform, e.g., halftone or RIP, any gray scale bit maps for the image regions. However, if 
a high quality print job is selected, then the publishing tool will use the device independent 
data and transform the gray scale bit maps to halftones and merge the regenerated halftone 
bit maps into the halftone of the page to allow for high quality rendering of the continuous 

20 tones within the page. 

[0038] With these described implementations, publishers do not have to manually rescan 
pages in the document including continuous tone as the device independent gray scale bit 
map of the images is maintained in the TIFF file format of the preferred embodiments. 
Further, the publishing tool automatically handles the image and text sections of a page to 

25 provide optimal output. Although maintaining gray scale representations of the images 

increases processing time, with the described implementations, both the quality of the output 
as well as processing time is optimized because only those pages having continuous tone are 
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descreened and stored as a gray scale bitmap that must be rasterized later. For those 
pages including only text or line art, the saved halftone is sent to the printer without having 
to perform the costly transformation processes. A halftone of line art or text alone usually 
can print at high quality on any printer. Thus, use of processor cycles to transform gray 
5 scales is minimized by halftoning only those pages that need to be halftoned again to ensure 
the selected high quality of print output. 

[0039] Further, different compression techniques may be used for different components 
in the TIFF file, depending on whether the DDF indicates the component to be a halftone or 
gray scale bit map. The gray scale bit maps in a TIFF file may be compressed using 

1 0 compression algorithms particularly suited for compressing continuous tone, such as JPEG 
or Lossy. The halftone images in the TIFF file may be compressed using LZW or another 
Lossless algorithm. This insures that compression for both the halftone and gray scale 
components are maximized, thereby rrnnimizing the storage space needed to archive the 
pages. The preferred embodiment TIFF file further minimizes use of storage space 

1 5 because gray scales are only maintained for those pages that include continuous tone 
images; gray scales are not maintained for pages that only include text or line art. This 
minimizes the number of gray scale bit maps archived, and hence riiinimizes storage space 
as gray scale bit maps require substantially more storage space than halftoned bi-level bit 
maps. 

20 [0040] Following are some alternative implementations for accomplishing the present 
invention. 

[0041] The described implementations may be implemented as a method, apparatus or 
article of manufacture using standard programming and/or engineering techniques to 
produce software, firmware, hardware, or any combination thereof. The term "article of 
25 manufacture" as used herein refers to code or logic implemented in hardware logic (e.g., at 
integrated circuit chip, Field Programmable Gate Array (FPGA), Application Specific 
Integrated Circuit (ASIC), etc.) or a computer readable medium (e.g., magnetic storage 
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medium (e.g., hard disk drives, floppy disks,, tape, etc.), optical storage (CD-ROMs, 
optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, 
PROMs, RAMs, DRAMs, SRAMs, firmware, programmable logic, etc.). Code in the 
computer readable medium is accessed and executed by a processor. The code in which 
5 preferred embodiments are implemented may further be accessible through a transmission 
media or from a file server over a network. In such cases, the article of manufacture in 
which the code is implemented may comprise a transmission media, such as a network 
transmission line, wireless transmission media, signals propagating through space, radio 
waves, infrared signals, etc. Of course, those skilled in the art will recognize that many 
1 0 modifications may be made to this configuration without departing from the scope of the 
present invention, and that the article of manufacture may comprise any information bearing 
medium known in the art. 

[0042] The printer server 6, storage system 8, workstation 4, and printer 12 may be 
implemented in many types of computing environments, such as a network printing 

15 environment where the network 14 provides communication between the devices 4, 8, 12. 
Alternatively, the printer server 4 and storage system 8 may be included in a single desktop 
workstation, such as a personal computer or workstation, that is connected to the printer 
12 via a parallel port. Numerous other configurations are also possible. 
[0043] The printer 12 may be a contone printer or bilevel printer. A contone printer uses 

20 a combination of dithering and printing at different levels of intensity to produce different 
colors and different shades of lightness and darkness. Unlike a true continuous-tone 
printer, contone printers can dispense toner at only a few different levels of intensity. To 
produce the full range of colors that can be perceived by the human eye, they must also use 
dithering techniques. 

25 [0044] In the logic described with respect to FIG. 3, the input files included halftoned 
screened digitized images. In alternative implementations, the received input files may 
comprise gray scale images in a device independent format. In such case, the publishing 
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tool 18 could halftone the entire page to allow for reproduction of text and line graphics, 
and then maintain the gray scale representations of image sections within the page to save 
for later halftoning for a particular printer. This described process reduces the space 
needed to store the data for the page and, at the same time, allow for high quality 
5 reproduction on all printers because only the image sections are stored in the gray scale 
format, which requires substantially more space than the halftoned image. 
[0045] The described implementations were discussed with respect to line art and 
continuous tone graphics data types. However, the documents processed may include any 
type of digital information for rendering graphics matter, such as text, line art, continuous 

10 tone, images, etc. 

[0046] The implementations were described with respect to graphics input that 
comprised pages of a document or book. However, the preferred image file formats may 
be used to maintain data for any type of graphics input including information on graphics 
matter, regardless of the format in which the graphics matter is stored, i.e., pages, etc. 

1 5 [0047] In the described implementations, the pages the publishing tool receives as input 
are stored in a file per page. Additionally, the input files received may include multiple 
pages of a document. Further, the output in the TIF F file format described with respect to 
FIG. 2 may include the content for one or more pages. Still further, the number of output 
files including the document pages may differ from the number of input files including the 

20 document pages in the device dependent file format if pages are combined in the output files 
in a manner that is different than how the pages are dispersed throughout the input files, 
[0048] The above discussed implementations were described with respect to producing 
an output file for printing. However, those skilled in the art will appreciate that the 
preferred embodiments may be used to produce graphics output for any type of output 

25 device, including printers, plotters, display monitors, lithography machines, three 
dimensional modeling machines, etc, 
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[0049] The discussed implementations describe storing in the TIFF file either gray scale 
bit maps, which includes both color and shades of gray, and halftone bit maps. Those 
skilled in the art will appreciate that the file format of the preferred embodiments may store 
print image files in any type of file format, such as vector formats, PostScript™**, other bit 
5 mapped formats, and any other graphics file format known in the art, e.g., CGM, DXF, 
EPS, GEM, HPGL, IGES, PIC, PICT, WMF, etc. 

[0050] The described implementations of the device independent images utilized a gray 
scale format and the device dependent images utilized a bi-level format. In alternative 
implementations, the device independent formats typically had more possible bit values per 

1 0 pixel than the device dependent format. However, in alternative embodiments, the device 
dependent format may have the same number or more possible bit values per pixel as the 
device independent format. Moreover, the device dependent and device independent 
formats may represent the image in different color spaces. For instance, for color images, 
the device independent format may be maintained in as a CIE-Lab color space format, and 

15 the device dependent format may represent an image in a Cyan-Magenta-Yellow-Black 
(CMYK) color space that is a transformation of the CIE-Lab device independent image to 
the CMYK device dependent color space supported by the printer. 
[0051] In the described implementations, the file including the device dependent and 
device independent data structures providing representations of printed matter on a page 

20 are maintained in a single file, e.g., a single TIFF file. In further embodiments, the device 
dependent and device independent data structures may be maintained in multiple files, 
wherein the files include information to relate the separate files and device dependent and 
device independent data structures therein to accomplish the purposes of this invention. 
[0052] The TIFF implementation is described as providing input for a printer. However, 

25 those skilled in the art will appreciate that in alternative implementations the image file 
format may maintain graphic information for any output device, including any type of 
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printers, display monitor or any other display medium capable of displaying graphical 
images. 

[0053] The described implementations maintained multiple types of print image file 
formats in a file in the TIFF format. Additionally, alternative file formats known in the art 
5 may be used to provide both device dependent and device independent print image files in 
the same file. Preferably, the device independent files are used to store print matter that 
needs to be transformed for the specific printer in order to print at a high quality, such as 
continuous tone images, and the device dependent files are used to store print matter that 
does not need to be transformed for each specific printer to print at a high quality, such as 

1 0 text or line art. One advantage of this file format is that the number of device independent 
files maintained in the print file, which typically require more space than the device 
dependent files, is minimized as the print file only includes device independent files for print 
matter, i.e., the image sections, that must be transformed for a specific printer to print at a 
high quality. Further, device independent files typically require significantly more 

1 5 processing, which degrades printer performance, than device dependent files before they 
are ready for printing. Thus, the preferred file formats further maximize printer performance 
by minimizing the processing cycles used to transform print device independent files. Print 
matter that does not need to be completely transformed in order to print at a high quality on 
different printers is maintained in device dependent files, which require substantially less 

20 transformation processing than device independent files. 

[0054] The foregoing description of the preferred embodiments of the invention has been 
presented for the purposes of illustration and description. It is not intended to be exhaustive 
or to limit the invention to the precise form disclosed Many modifications and variations 
are possible in light of the above teaching. It is intended that the scope of the invention be 

25 limited not by this detailed description, but rather by the claims appended hereto. The 
above specification, examples and data provide a complete description of the manufacture 
and use of the composition of the invention. Since many embodiments of the invention can 
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be made without departing from the spirit and scope of the invention, the invention resides 
in the claims hereinafter appended 



**TIFF and PostScript are trademarks of Adobe Systems Inc.; IBM, AEK, OS/2, and 
RS/6000 are registered trademarks of IBM and OS/390, INFOPRINT, and MVS are 
trademarks of IBM; WINDOWS is a registered trademark of Microsoft Corporation; 
UNIX is a registered trademark licensed by the X/Open Company LTD. 



